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LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) N^ME: NOVAR3TS AG 

(B) STREET: Schwarzwaldallee 215 

(C) CITYABasel 

(E) OTUNTrA Switzerland 

(F) POSTAL COSE (ZIP) : 4058 

(G) TELEPHONE: \41 61 69 11 11 

(H) TELEFAX: + 41\61 696 79 76 

(I) TELEX: 962 991 
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(ii) TITLE OF INVENITCN: 
corrpounds 



its in or relating to organic 



(iii) NUMBER OF SEQUENCES: 33 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) CCMTOIER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, 



sion #1.25 (EPO) 



(2) INFORMATION FOR SBQ ID NO: 1: 



(i) SEQUENCE CHARACTKRISTICS : 

(A) LENGTH: 6695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: unknown 



(ii) MDLECULE TYPE: ENA (gencanic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Daucus carota 

(lx) FEATORE: 

(A) NAME/KEY: CDS 

(B) LOCATICN: 3696 -.6617 

(ix) FEATURE: 

(A) NAME/KEY: outran 

<B) LOCAITCN: 3731. .3802 

(ix) FEATORE: 

(A) NAME/KEY: intran 

(B) LOCATICN: 3851. .3979 

(ix) FEATURE: 

(A) NAME/KEY: intran 

(B) LOCATICN: 4124.. 4211 

(ix) FEATURE: 

(A) NAME/KEY: intran 

(B) LOCATICN: 4284. .4357 

(ix) FEATURE: 

(A) NAME/KEY: intran 

(B) LOCATICN: 4430.. 4528 

(ix) FEATORE: 

(A) NAME/KEY: intran 

(B) LOCATICN: 4642.-4757 
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(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCA3TCN: 4890.. 4967 

(ix) FEATORE: 

(A) NAME/KEY: introm 

(B) IjOCATICN: 5295.. 5803 

(ix) FEATORE: 

(A) NAME/KEY: intran 

(B) LOCATION: 6197.. 6339 

(xi) SEQUENCE DESCRIPTION: SEQ 3D ND: 1: 

TCTAGATGAC GAAATCGCGC TAOCTTIGAT TTNGAAATAC TAGGTTGTAG TATCTIGATT 60 

AGTnnTGG ATATCTTGCT GTAA LLT1U1T TAGGAGATGC AAACGGTCTT CATTTAATAT 120 

GAGCCCTK5T GACTTGACAA AAGTATCTAG CATCTTIGAT CACGAQGTAG CTAAAAACTA 180 

GCGTGrlTIGA TTAAGCACAT AAIMTGTAT TQGGCCTA3T GGCTATCAAT GAACJITIGAT 240 

GCAAGTATAT AGCTTCTATT ATGCATGTIGA TGAQQCTATA TAAAAGAAGTT AAAGAACATT 300 

CTCTCGTAGC ATICAnTTT CTCTIX3CCTA TAGTTAACGA GTCTriGTCAC ACATCACGTT 360 

GAAACTGGAT (JimVlUllC TTCCATCTAA GTTTQGAIITA CCTGATAGAT GCTCAACTIC 420 

TICGTCAGCC ITTTCTTrCC GAiriTTCCC AAGACAAGAT TCTTTAGTTA ATAGfTEATIG 480 

CTCTGGTOQC TTGTCTQCAT TTIAGGAATC TTACTCTGrlT TTTTAATQGA GAAACGAAAC 540 

CTftO LTlTlT TTCTOTGnTC CCTTTEATGA TATCACCTGC TTGGAGGCGT TTAGACTTTA 600 

TCCADCTAAA CTAHTCATOT TTACCAGACA AGCTAXAOGT TTTATCCCCC OOOOOOOOQG 660 

ACCTGN3GAC AAAAGAAGCG CTGATGAACT GA2TIAAT0C CTGTTCTATT ATAITACACA 720 
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TIGATOCnC ATOGAGCTAA TATCTTIGGT TAAATTTCAT GTIAIAIATAT ACCCTICCCT 780 

C 1 TCI G ATOG CAGT3GCCCC TCGTTEAA3T AGCGTACTTA ATTAUCTGAT GGAXACICTA 840 

TCCTIX3GCAG ATCATCECAT CAGMTAIAC CMTICTICT GCICTACAAA AIAAAAAACC 900 

TCTATTIATG TTCMCTTIT TQCTAACAAG TAACTAATIG ATGOGCTATG TTGACAGGGG 960 

ATCCATTACA CAACITACGA ACTAGCTIQC AAGATCCCAA CAATCTCCTC CAGAGCTGGG 1020 

ATCCAACCCT TGTIGAACCCT TGCACATCGT TTCATOIGAC ATCTAACAAT GAAAACAGTC 1080 

TTATAAGAGT CTAGGTCACT TCCOTTATTA ATTTTTTIAG CAACTEAGGA AKOTTACTC 1140 

AATTGAGCAG ATGTCTCITT AAATA3TTIT CTTTAATTTC TTAGCTAAGC QGAGCATCTA 1200 

TCTTAAGTAT CTCTACTSAA TTTAAGACAT AATACA3TTT TTTAAAAAAT CTATTAGAGT 1260 

GMTI ' ITI CCG CACAGCGCAC ATATATCTIT TTTCTOCTAA TTCAGACAAC CTITCTCCCG 1320 

ACGATAAAAT AATATAAGAT TAACTCCTTG AACTAATTTT TTATTTTTCT TITCTITTTA 1380 

ICTTCnTGC AGAAAGTCTIC TTATQGTCTT 1TGTTGAAAAG TACAT1CEAT GATAATTTIT 1440 

TGGCAACTCA TATAAA3TTA TATATATTCC ATGTTAGTTAT AAGTCTAAAAA AAGCTTCOTA 1500 

TTAATTCCAA GATAGAGGTIT CAHTl'lTATA GTITQGGCAT CCATCAGTIT TTGAAAATGT 1560 

CAGAAATTIT GTTGAGrlTAA TTTTACTEAC CAACTTTIAT GGCX3TCATGC AGTGATCTTG 1620 

GGAATGCAGC ATTATCTOGrT CAATTOGrTTC CTCTPQGCCA GTIGAAAAAT TTACAATACT 1680 

TGTAAGACCA TATCACTTGG AATGCTTIAG TITl'iMCACA GCACAATGCT TTCAATATCT 1740 

CTTAAAAGTG TGAAAAAGTCT GACTTTOTAG CTICAGCAGT TGTTOQGAXA ATATCTATGA 1800 
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AGCACTEAAA AGGCTQQGCA AITITITIUT TATTATITCA AATATTGTIA ATK7ITACTA 1860 

CTEAATATCA TAAACTGMT TAACICCTCA TGATTGCTCT CAGTCCAATG TQCCCTCATT 1920 

AGTCACAINA TAAAATTOGN GQGTIQGACA AATATAACTT CTTTTCTEAA GGTICCAGAAA 1980 

GAGCACTTAT CAACCTTGTC TAGQGCATAA CGTCACACTG GGTCAGTTCAC GQGCTATCCA 2040 

CTTT3GGGAG GITITAATGA GCACTTATTT ACCTTCfPCTT TTAAAOCTCT GAGGATGFTTA 2100 

TTAAAGTCTG CATOVITCAG AGITEAAAIIT AGCACTTTCA CTICTAITAT GAATGGTEACA 2160 

TGAAAGATAC ATATCTIAAT GTTCCTATGC CTGfTTTCAAC ATOTCTC7EAA TA3TCTGTTA 2220 

TCTITCTCAT CTTAAAAATG GCACTGATTA AAA1KJPGAGA AAGCTMTICT TCX^ATACCA 2280 

TTICATGTAT ACCAGAGAAT ATCATAATIT TTITAAATCA TAACTTQGGC CCTAGAGTTT 2340 

TCTCAGTATT GGTCTATTTA TATTTTCCAC CA3TTAGAAC TGTGTTGTCA GATCAAAATC 2400 

TTQGACTTCC ACAGAAGATC TTAIAGTAAA AGTATTCTIT AGATCTGATC ATCAAAGTTG 2460 

TCATGGTGTG GCCTGTCCCA GAATTTAAAT CAATCCCATG TCACATOTTT GTTGATCIGA 2520 

CTACTCACTG TTAATCGAAG ACTAACTATT TGTCAATTAA ATOCTnTIT TTTTGTTCTT 2580 

CATGCTTAGC GTTATAAAGG TCTACGTCTC ACTATOGTTT TTAACATGTT ATAGTTITGT 2640 

ACTGACAACT TTAAAGMTIC TCTTGTTTAC GAA3TAAGAA TATATAATAT AAAACGCTTT 2700 

AACTTTCTCT GTOQAAOGTG TICTTACCIT TTTATATATA TATATAGAIA CTCAGACTCT 2760 

GCTGGCAATT ATATCTTACG AACTTACGAG TATACAGAAC TTCTATAITA GGTTCAGATG 2820 

AGTGGCTGTTA GTAGAACACC TTAAGCAAGA ACITAATCAT GAGGmTCAA OCTTTTAACT 2880 

TTCTTITEAG ATTTTTICAA GTTXATOGAA AAITGTEACCT CATGATCCTG CnTTCTTTCC 2940 
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ATAAACITIC CAIATAAGTCC CX7ITICITCA CGTITITCATG TAAGCTCITC AOGAGTIGATT 3000 

ATTAGCGCTT CTTICAATAA TCATAATGTG TCTCACTTPG AIGAGGOC1G TACTEMTAT 3060 

TCCACCTTGC ACTTAACCTT GATCCTCATC TCATCTIGAT TGTICATAGrTC TACTAACCGA 3120 

GTITCAACATC GTITIATCATG TCTITIGAGG TAACAATCTA GCTTTCACCT CIGTCCITGK 3180 

TATAGGTITA AGGCITGCAC CTCCCACTAG CCTITOGnG T1T1MTCAC AGTTCACACA 3240 

CCTACTAGCA CTGTTCACCT CTAGTCTTTT GTCCQCAAAT ACTAAGAAGT TTCTITOGCA 3300 

TAATAGTGGA TGATCATTTA AGAAATAGTG AATCAAATEA TCGTIGTTATT GTGmTGTAC 3360 

TTTOGAATTA AATCAGTITGC TGAACATTGT TOCTGTTITAT CGTPGTCAAG GCTTTGCCAA 3420 

GGAAGGCGAT TAGTAAGAGTT GGGCATCCAA GCGCCTTTAT CTTGAAGGGG OQGGCGGCAC 3480 

GTTCTGGATT CTGGGTGTTCT A3TAGAGGAC ATEA3CEATA TA3ACTGATT ATTTATTAGA 3540 

ATATAAATCA ACTACTATAT TITTCTTTGT AATGriTIATA TAGAAATCCC ACTCGTAAAC 3600 

TTGACAAATA CCATPGAAAT A3TTGAACCT AATTAATTAG TAGTTGTCAGG TTTAAATTCA 3660 

AACTCATTTA A' lTlTA CTTT AAAAAATAAT TCTATATCAA TCGTAACAGT ATAAATATAT 3720 

TAAATTACAT CTATGTGTCC CTATATATAG CTGAATGTCT AATAGACTCC AAGAOQGCTG 3780 

CTCTTACTGC CTAGGCGTCC AGGCAGTTCA CTGATGCITA CCTTGACAAA TATQQQGTITC 3840 

CTAIGACATT GTTQQQGATC CCTATCACTG GA3TCCTGTT TTOCTGACCC TCTGTTCAAT 3900 

TGATITICAT TSATCTAGTA TTACTAGTTT TAIAAAIATT CTTTATTQCA ATAATTTAAC 3960 

TGGAGOTEAA CAATCACAGG GAGCITTACA GCAATAACAT AAGTOGACCA ATTCCTAGTG 4020 
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ATCTIGGGAA TCTGACAAAT TTGGTIGAGCT TQGAOCTAIEA CAIGAAIAGC TTCTCTGGAC 4080 

CTAEACOGGA CACATTAGGA AAGCITACAA GGCEAAGATT CTIGTATGAC TACAAATCTT 4140 

CACTA LTITIT TAACTTAATC CAAITIGATr ATCCTTTCAA GTGATTCATT ATATCACAAA 4200 

TTACTGGAIA GGOGTICTCAA CAACAACTOC CTCTCTOGTTC CAA3TCCAAT GTCACTGACT 4260 

AATATIACAA CICITCAAGT OCTCTAAGTEA TICCGACCTT TCCAGATAGT TTTGTTGTTG 4320 

TCGATOTTIC AATTTTAATA CEAAAXATGT TCATCAGGGA TTEATCAAAC AATCGGCTAT 4380 

CAGGACCAGT ACCGGAIAAT GGCTCAIITIT CTITGTTEAC ACCTATCAGG TTEAATGCTA 4440 

CTAATATCTT TAATATTATG GTITCTTACTT CTACTGCGAA AGCTATGAIA AIATinTTT 4500 

TCTCCTTCAT ATATTATCAC TTTOGCAGTr TTOGCAATAA TTTSAATTEA TCTQGACCTG 4560 

TAACTGGGAG GCCCTGCCCT GGATCTCCCC CATITrCTCC ACCACCTCCG TTCATCCCAC 4620 

CATCAACAGT ACAGCCTCCA GGTGATTTAG TTTTEAIA2T AATTCCCGTA ATEAAITnA 4680 

TCACTCTAAA AATTGCTGrTT AATTTCACCA GTTGCGAATA AAGTATTITC CITCTTTCTC 4740 

TTCTTATEAT TATGAAGGAC AAAATGGTCC CACTOGAGCT ATTGCTQQQG GAG7EAGCTGC 4800 

TGGTCCTCCT TTACTGTTTG CTGCACCT3C AATQGCATIT GCATQGTOGC GGAGAAGAAA 4860 

ACCGCGAGAA CAnTCTTTG ATCTGCCAQG TIAGTCCTGT AAATAGATAT CEA3TGAAGC 4920 

GCTEACTGTC TGTQGACTTT GTITICACTG TCATTAGTTA ACTTCAGCTG AAGAGGAOOC 4980 

AGAAGTIGCAC CTIGGTCAAC TCAAGAGGTIT TTCTCTGCGA GAATCGCAAG TCGCAACGGA 5040 

TACTTITAGT ACCA3CCTTG GAAGAGGTOG ATTT3GTAAG CTG7TATAAGG GACGCCTK3C 5100 

TGATQGCTCA LTIGI AGCAG TTAAAAGGCT TAAAGAAGAA CGAACAOCAG GTQGCGAGCT 5160 



-45- 



GCAGTITCAA ACAGAAGTQG AAATGATTAG CMGGCTOIG CATCGAAATC TTCTGCGTCT 5220 

ACCTGC?ITrc TCCATGACAC CTACCGAGCG GCrTCTTCTA TATCCAIACA TQGCTAATGG 5280 

AACTCTIGCG TCATGmTAA GAGGTATCTC AGTTACAATT ACCAIAACIT GCCAGAAGTT 5340 

TCTTIGAITA AAAATCAAAT ATAACTCCCT ACACTATOIT AAGGTICTTAT AATITCTGAG 5400 

CAGATCTTAT TTCCCATTGC AAGATACCAG TTA3TAT1CT TTTTTOTGrrA ATCGAIACCG 5460 

GTITATAriTC TTICTTGrlAT TTQGTTATAT GCAAGGATCT CGAGTCTAAT AAGTTATCAA 5520 

ACTGGATGCT ATGTTTATIC TGCAATTGAA TTCTTGCTIC ATCTGCCAAA AIAIATATGA 5580 

TTCAACTTQG AATCATCTIA TAATATACTG TGTCAAAGTCA GCToTTGACT TTCATCATIA 5640 

ATTAGTCTIC ATAAATCAGA ATCTGCCTAG TGAGCTTTAC OGACATACTC TAAACCTTCC 5700 

TTATGGCCCT CTATATAATC GTCCCACTTA CTITA1TCAG TITCXCTGCT CTCTGAA3TT 5760 

TTGATCTGTA CATTGTGATG TLTlULTril ' ATCAAATGTA GAGCGfTCAGC CATCAGAACC 5820 

TCCCCCTGAT TGGCCAACTA GQGAGAGGAT TGCACTAGGA TCTTCTAGGG GCCTATCTAA 5880 

ATTGCATGAC CATTGTGASC CCAAGATTAT CCATCGCGAT CTAAAAGCTG CAAATATATT 5940 

ATTGGACGAA GAATTTGAGG CTGTTGTAGG TGATTCTGGG TTAGCTAQGC TCATOGATTA 6000 

CAAGGATACC CATGTITACGA CTGCTGTTAAG QQGTACCATT QQQCACATAG CTCCCGAGTA 6060 

CCTCTCGACT GGAAAGTTCAT CAGAGAAGAC CGATCTCTTT GGTTTATGGGA TAATCCTCCT 6120 

AGAGCTCATT ACTGGACAGA GGGCTnTGA TCTTGCTCGC CTTGCGAAOG AIGATCATGT 6180 

TATGTIXjITG GATTGQGTAT CTGTCCCQQG TGTTOCrTTG GTTAATTATT TCACATATTA 6240 
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GIQCTEACEA ClTllrfl ' lUlfc GOOCnTGTT ITiKlTlULT GUL'IUIMTI 1 GAITCTTAGT 6300 

CAICTTATGC ATATTX3ACCT GCTTTGCAAT GTCTTTTAGG TTAAAAGCCT TTTGAAAGAG 6360 

AAAAACTTGG AGATGCTQGT OGATCCTGAC CTGCAGAACA ATTACA1TGA CACAGAAGTT 6420 

GAGCAGCTTA TTCAAGTAGC AITACTCTCT ACCCAGGGTT OQCCAATQGA GCQGCCTAAG 6480 

ATGTCAGAGG TAGTCCGAAT GCITGAAQGT GATQQOCTTG CAGAAAAGTG GGACGAGTQG 6540 

CAAAAAGTTG AAGTCATCCA TCAAGACGTA GAATTAGCTC CACATOGAAC TTCTGAATOG 6600 

ATCCTAGACT CGACAGATAA CTTGCATGCT TTK5AATTAT CTQGTCCAAG ATAAACAGCA 6660 

TATAAAATCT AATGAAATTA ATAI'ITITIA TGGTT 6695 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cENA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Daucus carota 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 94.. 1752 
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(xi) SEQUENCE DESCRIPTION: SEQ ID ND: 2: 

GACAAATACC ATTGAAATAT TTGAACCTAA TTAATTAGTA GTTCTCAGCTT TAAATTCAAA 60 

CTCATTTAAT TTIACTTTAA AAAATAATIC TAT ATG AAT OCT AAC ACT ATA AAT 114 

Met Asn Arg Asn Ser lie Asn 
1 5 

ATA TTA AAT TAG ATG GAG TIC ACT GAT GCT TAG CTT GAC AAA TAT GGG 162 
He Leu Asn Tyr Met Gin Phe Thr Asp Ala Tyr Leu Asp Lys Tyr Gly 
10 15 20 

CTT CTT ATG ACA TTG GAG CTT TAC AGC AAT AAC ATA ACT GGA CCA ATT 210 
Val Leu Met Thr Leu Glu Leu Tyr Ser Asn Asn He Ser Gly Pro He 
25 30 35 

CCT ACT GAT CTT GGG AAT CTG ACA AAT TTG CTG AGC TTG GAC CTA TAC 258 
Pro Ser Asp Leu Gly Asn Leu Thr Asn Leu Val Ser Leu Asp Leu Tyr 
40 45 50 55 

ATG AAT AGC TTC TCT GGA OCT ATA CCG GAC ACA TTA GGA AAG CTT ACA 306 
Met Asn Ser Phe Ser Gly Pro He Pro Asp Thr Leu Gly Lys Leu Thr 
• 60 65 70 

AGG CTA AGA TTC TIG CCT CTC AAC AAC AAC AGC CIC TCT OCT CCA ATT 354 
Arg Leu Arg Phe Leu Arg Leu Asn Asn Asn Ser Leu Ser Gly Pro He 
75 80 85 

CCA ATG TCA CTG ACT AAT ATT ACA ACT CTT CAA CTC CTG GAT TTA TCA 402 
Pro Met Ser Leu Thr Asn He Thr Thr Leu Gin Val Leu Asp Leu Ser 
90 95 100 

AAC AAT GGG CTA TCA GGA CCA GTCA COG GAT AAT GGC TCA ITT TCT TTG 450 
Asn Asn Arg Leu Ser Gly Pro Val Pro Asp Asn Gly Ser Phe Ser Leu 
105 110 115 



TTT ACA CCT ATC ACT TTT GOC AAT AAT TTG AAT TTA TCT GGA CCC GTA 
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Phe Thr Pro lie Ser Phe Ala Asn Asa Leu Asn Leu Cys Gly Pro Val 
120 125 130 . 135 



ACT GGG AGG CCC TGC CCT GGA TCT CCC CCA TIT TOG CCA CCA CCT COG 
Thr Gly Arg Pro Cys Pro Gly Ser Pro Pro Phe Ser Pro Pro Pro Pro 
140 145 150 
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TIC ATC CCA CCA TCA ACA GTA CAG CCT CCA GGA CAA AAT GGT CCC ACT 
Phe lie Pro Pro Ser Thr Val Gin Pro Pro Gly Gin Asn Gly Pro Thr 
155 160 165 
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GGA GCT ATT GCT GGG GGA GTA GCT GCT GGT GCT GCT TTA CTG TTT GCT 
Gly Ala He Ala Gly Gly Val Ala Ala Gly Ala Ala Leu Leu Phe Ala 
170 175 180 
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GCA CCT GCA ATG GCA TTT GCA TOG TQG OGG AGA AGA AAA CCG CGA GAA 
Ala Pro Ala Met Ala Hie Ala Trp Tip Arg Arg Arg Lys Pro Arg Glu 
185 190 195 
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CAT TTC TTT GAT CTG CCA GCT GAA GAG GAC CCA GAA GTG CAC CTT GGT 
His Phe Phe Asp Val Pro Ala Glu Glu Asp Pro Glu Val His Leu Gly 
200 205 210 215 
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CAA CTG AAG AGG TTT TCT CTG CGA GAA TTG CAA GTC GCA AGG GAT ACT 
Gin Leu Lys Arg Phe Ser Leu Arg Glu Leu Gin Val Ala Thr Asp Thr 
220 225 230 
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TTT AGT ACC ATA CTT GGA AGA GGT GGA TTT GGT AAG GTG TAT AAG GGA 834 
Phe Ser Thr He Leu Gly Arg Gly Gly Phe Gly Lys Val Tyr Lys Gly 
235 240 245 



CGC CTT GCT GAT GGC TCA CTT GTA GCA GTT AAA AGG CTT AAA GAA GAA 882 
Arg Leu Ala Asp Gly Ser Leu Val Ala Val Lys Arg Leu Lys Glu Glu 
250 255 260 

CGA ACA CCA GGT GGT GAG CTG CAG TTT CAA ACA GAG CTG GAA ATG ATT 930 
Arg Thr Pro Gly Gly Glu Leu Gin Phe Gin Thr Glu Val Glu Met He 



-49 



265 



270 
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AGC ATG GCT GIG CAT OGA AAT CIT CTG OCT CTA OCT QCT TIC TGC ATG 
Ser Met Ala Val His Arg Asn Leu Leu Arg Leu Arg Gly Phe Cys Met 
280 285 290 295 
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ACA CCA ACA GAG OGG CTT CIT CTA TAT OCA TAC ATG GCT AAT GGA ACT 
Thr Pro Thr Glu Arg Leu Leu Val Tyr Pro Tyr Met Ala Asn Gly Ser 
300 305 310 
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GIT GOG TOG TCT TEA AGA GAG OCT CAG CCA TCA GAA CCT CCC CIT GAT 1074 
Val Ala Ser Cys Leu Arg Glu Arg Gin Pro Ser Glu Pro Pro Leu Asp 
315 320 325 



TCG CCA ACT AGG AAG AGG ATT GCA CTA GGA TCT GCT AGG GOG CIT TCT 1122 
Trp Pro Thr Arg Lys Arg He Ala Leu Gly Ser Ala Arg Gly Leu Ser 
330 335 340 

TAT TTG CAT GAC CAT TCT GAT CCC AAG ATT ATC CAT CCT GAT CTA AAA 1170 
Tyr Leu His Asp His Cys Asp Pro Lys He He His Arg Asp Val Lys 
345 350 355 

GCT GCA AAT ATA TEA TTG GAC GAA GAA TTT GAG GCT GIT CTA GCT GAT 1218 
Ala Ala Asn He Leu Leu Asp Glu Glu Phe Glu Ala Val Val Gly Asp 
360 365 370 375 

TTT GGG TEA GCT AGG CTC ATG GAT TAC AAG GAT ACC CAT GTE ACA ACT 1266 
Phe Gly Leu Ala Arg Leu Met Asp Tyr Lys Asp Thr His Val Tbr Thr 
380 385 390 

GCT CTA AGG GCT ACC TIG GGC TAC ATA GCT CCC GAG TAC CTC TOG ACT 1314 
Ala Val Arg Gly Thr Leu Gly Tyr He Ala Pro Glu Tyr Leu Ser Thr 
395 400 405 



GGA AAG TCA TCA GAG AAG ACC GAT GTC TTT GCT TAT GGG ATT ATG CTC 
Gly Lys Ser Ser Glu Lys Thr Asp Val Phe Gly Tyr Gly He Met Leu 
410 415 420 
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TEA GAG CTC ATT ACT GGA CAG AGA GCT TIT GAT CTT GCT CGC CTT GOG 1410 
Leu Glu Leu He Thr Gly Gin Arg Ala Phe Asp Leu Ala Arg Leu Ala 
425 430 435 

AAC GAT GAT GAT GIT AUG TIG TIG GAT TGG GET AAA AGC CTT TIG AAA 1458 
Asn Asp Asp Asp Val Met Leu Leu Asp Trp Val Lys Ser Leu Leu Lys 
440 445 450 455 

GAG AAA AAG TIG GAG ATG CIG CTC GAT OCT GAC CTG GAG AAC AAT TAC 1506 
Glu Lys Lys Leu Glu Met Leu Val Asp Pro Asp Leu Glu Asn Asn Tyr 
460 465 470 

ATT GAC ACA GAA GTT GAG CAG CTT ATT CAA GTA GCA TTA CTC TGT AOC 1554 
He Asp Thr Glu Val Glu Gin Leu He Gin Val Ala Leu Leu Cys Thr 
475 480 485 

CAG GGT TOG CCA ATG GAG OGG CCT AAG ATG TCA GAG GTA CTC CGA ATG 1602 
Gin Gly Ser Pro Met Glu Arg Pro Lys Met Ser Glu Val Val Arg Met 
490 495 500 

CTT GAA GGT GAT GGC CTT GCA GAA AAG TGG GAC GAG TGG CAA AAA GTA 1650 
Leu Glu Gly Asp Gly Leu Ala Glu Lys Trp Asp Glu Trp Gin Lys Val 
505 510 515 

GAA CTC ATC CAT CAA GAC GTA GAA TTA GCT CCA CAT CGA ACT TCT GAA 1698 
Glu Val He His Gin Asp Val Glu Leu Ala Pro His Arg Thr Ser Glu 
520 525 530 535 

TOG ATC CTA GAC TOG ACA GAT AAC TIG CAT GCT TTT GAA TTA TCT GGT 1746 
Trp He Leu Asp Ser Thr Asp Asn Leu His Ala Hie Glu Leu Ser Gly 
540 545 550 



CCA AGA TAAACAGCAT ATAAAATGIG AATGAAATTA ATAT1T1T1A TQGITAAAAA 
Pro Arg 
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AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Asn Arg Asn Ser lie Asn lie Leu Asn Tyr Met Gin Phe Thr Asp 
15 10 15 

Ala Tyr Leu Asp Lys Tyr Gly Val Leu Met Thr Leu Glu Leu Tyr Ser 
20 25 30 

Asn Asn He Ser Gly Pro He Pro Ser Asp Leu Gly Asn Leu Thr Asn 
35 40 45 

Leu Val Ser Leu Asp Leu Tyr Met Asn Ser Phe Ser Gly Pro He Pro 
50 55 60 

Asp Thr Leu Gly Lys Leu Thr Arg Leu Arg Phe Leu Arg Leu Asn Asn 
65 70 75 80 

Asn Ser Leu Ser Gly Pro He Pro Met Ser Leu Thr Asn lie Thr Thr 
85 90 95 

Leu Gin Val Leu Asp Leu Ser Asn Asn Arg Leu Ser Gly Pro Val Pro 
100 105 110 

Asp Asn Gly Ser Phe Ser Leu Phe Thr Pro lie Ser Phe Ala Asn Asn 
115 120 125 
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Leu Asn Leu Cys Gly Pro Val Thr Gly Arg Pro Cys Pro Gly Ser Pro 
130 135 140 

Pro Phe Ser Pro Pro Pro Pro Phe He Pro Pro Ser Thr Val Gin Pro 
145 150 155 160 

Pro Gly Gin Asn Gly Pro Thr Gly Ala He Ala Gly Gly Val Ala Ala 
165 170 175 

Gly Ala Ala Leu Leu Phe Ala Ala Pro Ala Met Ala Phe Ala Trp Trp 
180 185 190 

Arg Arg Arg Lys Pro Arg Glu His Phe Phe Asp Val Pro Ala Glu Glu 
195 200 205 

Asp Pro Glu Val His Leu Gly Gin Leu Lys Arg Phe Ser Leu Arg Glu 
210 215 220 

Leu Gin Val Ala Thr Asp Thr Phe Ser Thr He Leu Gly Arg Gly Gly 
225 230 235 240 

Phe Gly Lys Val Tyr Lys Gly Arg Leu Ala Asp Gly Ser Leu Val Ala 
245 250 255 



Val Lys Arg Leu Lys Glu Glu Arg Thr Pro Gly Gly Glu Leu Gin Phe 
260 265 270 



Gin Thr Glu Val Glu Met He Ser Met Ala Val His Arg Asn Leu Leu 
275 280 285 

Arg Leu Arg Gly Phe Cys Met Thr Pro Thr Glu Arg Leu Leu Val Tyr 
290 295 300 

Pro Tyr Met Ala Asn Gly Ser Val Ala Ser Cys Leu Arg Glu Arg Gin 
305 310 315 320 

Pro Ser Glu Pro Pro Leu Asp Trp Pro Thr Arg Lys Arg He Ala Leu 
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325 330 335 

Gly Ser Ala Arg- Gly Leu Ser Tyr Leu His Asp His Cys Asp Pro Lys 
340 345 350 

He He His Arg Asp Val Lys Ala Ala Asn lie Leu Leu Asp Glu Glu 
355 360 365 

Phe Glu Ala Val Val Gly Asp Phe Gly Leu Ala Arg Leu Met Asp Tyr 
370 375 380 

Lys Asp Thr His Val Thr Thr Ala Val Arg Gly Thr Leu Gly Tyr He 
385 390 395 400 

Ala Pro Glu Tyr Leu Ser Thr Gly Lys Ser Ser Glu Lys Thr Asp Val 
405 410 415 

Phe Gly Tyr Gly He Met Leu Leu Glu Leu He Thr Gly Gin Arg Ala 
420 425 430 

Phe Asp Leu Ala Arg Leu Ala Asn Asp Asp Asp Val Met Leu Leu Asp 
435 440 445 

Trp Val Lys Ser Leu Leu Lys Glu Lys Lys Leu Glu Met Leu Val Asp 
450 455 460 

Pro* Asp Leu Glu Asn Asn Tyr He Asp Thr Glu Val Glu Gin Leu He 
465 470 475 480 

Gin Val Ala Leu Leu Cys Thr Gin Gly Ser Pro Met Glu Arg Pro Lys 
485 490 495 

Met Ser Glu Val Val Arg Met Leu Glu Gly Asp Gly Leu Ala Glu Lys 
500 505 510 



Trp Asp Glu Trp Gin Lys Val Glu Val He His Gin Asp Val Glu Leu 
515 520 525 



# 
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Ala Pro His Arg Thr Ser Glu Trp He Leu Asp Ser Ihr Asp Asn Leu 
530 535 540 



His Ala Phe Glu Leu Ser Gly Pro Arg 



545 



550 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEXNESS: single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TlTl ' l ' lTl ' IT TGC 13 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTIH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEINESS: single 

(D) TOPOLOGY: unknown 



(iii) HYPOTHETICAL: NO 



(iii) ANTT-SENSE: NO 



-55- 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

GGGATCTAAG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) T¥PE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

ACACGTQGTCC 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TOPE: nucleic acid 

(C) STRANDETNESS : single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 
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(iii) AOTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 7: 
TCAGCACAOG 

(2) INFORMATION FOR SBQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTCH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEXNESS: single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

miTrmT tctg 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEMESS: single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 




(iii) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

TITTTTTTTT TCA 

(2) INFORMATION FOR SBQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDETNESS: single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ 3D NO: 

GACATCGTCC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: unknown 
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(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

CCCTACT3GT 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 

(iii) ANIT-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

ACACGTGCTC 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: unknown 
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(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SBQ ID ND: 13 

GGTCACTGTC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDETNESS: single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

TCTTQGACCA GATAATTC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTCH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) SIHANDEXNESS : single 
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(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

CTCTGAIGAC TTTCCAGriC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEXNESS : single 

(D) TOPOLOGY: unknown 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

AATGGCAITT GCATQG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTIH: 5 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEXNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SEX$SE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Daucus carota 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Ser Pro Pro Pro Pro 
1 5 

) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: untaown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Daucus carota 

(xi) SEQUENCE DESCRIPTION: SEQ ID ND 

His Arg Asp Val Lys Ala Ala Asn 



• 
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1 5 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENG7IH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Daucus carota 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 19 

Gly Thr Leu Gly Tyr lie Ala Pro Glu 
1 5 

(2) INFORMATION FOR SBQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4081 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEXNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: INA (genomic) 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Arabidqpsis fhaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Arabidqpsis SERK gene 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1280.. 1367 

(ix) FEATORE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1796. .1928 

(ix) FEATORE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2014. .2085 

(ix) FEATORE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2203.. 2346 

(ix) FEATORE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2450.. 2521 

(ix) FEATORE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2617.. 2688 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2772.. 2884 

(ix) FEATORE: 

(A) NAME/KEY: exon 

(B) LOCATION: 3015.. 3146 
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(ix) FEATORE: 

(A) NAME/KEY: exan 

(B) LOCATION: 3305. .3646 

(ix) FEAIURE: 

(A) NAME/KEY: exan 

(B) LOCATION: 3760.. 4081 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TCTAGAAACC T1T1G ATCAT AATGAAAATA AAGACTCCAT CCACCACATC GGGTAAGCAT 
AATGTGTGAT ATITAAAGGG TAACAAATGT AATCTGCTTT TTA3TITACT TTTTACCICT 
ACTCAAATTG TATQGGCACT TTITTTTTIT TTTTAAATGA TAAGACAAGTT ATCTG7ITEAA 
TGGTATTCTG ATGAAACAGT AGTAAAGTCA TATCQQGCAC GCCAIACTAC TICCACAGTC 
GAACTTQGCC AAASTTICTC TTTCCCGfTCT CTACAGTTTC TTCCACCAAA Tn-muriv* 
ACAAAACTCA AATCTTTCAA TCTCATCTCT GCCAAAGTCTG GGmTAGAAA GAATATCAGC 
AAACACTAAT A lCl'lT ATTC TTGCATGGTT TATCAATCAC AAAATTCACA ACCATTGTAA 
AAAAAAA3TC AC Ai'l'lTllj G TATGAGATTC CTCACATGAT ACTGAACCTC TTTAACATTT 
TAACTTTACT TTCATAAATA OGGGATTACG AATCTTACTT GCATEAAAAA TTEAGAAAAG 
GTTTTrCTAC TTAAAGAAAA AAQQGACCCA ACAGAGAGAG GITTGACCAG GAGAAACQQG 
TQCATAGCCT TAAGAGCTTT CAACTACTTT ACCCCAAACC CAAAGOGATC TCACTTTCAA 
CCATCTCTIC TCTCCCCOGA ACLXJOT1T1T TTGACCGGTC AGTTTOGGGCA GCAGCACCCT 
TACGGGCAGC TTATA3TCCT OGTCTTCCTC CTCTACACCA CTGCATGCCC ATAAATAAAG 



CCCCTTGAGA TCTITAAAAA TATEAAATAA 
AGAAGAAAAA GAGAGGAACA ACAACAACAC 
OTTGCGGCTT AAIAAAAAGC TCTITTGTIA 
GCICTEATIT TTTICTTTAA AAAAAAAAGT 
GTIXTrcrTGTGA GAGAGAGAGT GTIX3GTITGAT 
AAITAGGATT TTTATTITAT TTITIACTCT 
AAGGCTTATC GAAAAAATGA GIGAGUT1U1 1 
TGGTGMTIT CGGAAGTTAG GGTITICTCG 

Q 

\J TTTAGCATK3 TTCnTIGAAA TGGACTQGAG 

n ^ CTTACTICCG AATCATTCAC TGTQGCITGC 

h 

fT CAATTACTCA GCTTTACTCG TTIdCAATT 

^0 AATCGCTATC TITAGTGTCT GCA3TTTGAT 

GTAAGATITA GTQGCTAGTIA CTTTGAATAC 
TTGTATATTG TAAAGGCA3G TTCTITGGGT 

TTGATGrrror tgatccaaac atictctgaa 

AnTGTTTAA TTA3TAG0CT CTAATCTCAG 
A3TAGACTIT TCACCAAITG ATGCTAATIG 
GCTITGCATA CITIUAGGCT TACTCEAGTIT 
CCTACGCTAG TGAATCCTIG CACATQGOTC 
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TATA3CAADG AAAAAGCTAT TTEATTCATA 840 

ACTAATCATA GITTCTCIGG CAGGCTTOIT 900 

TEAUTACTTC AGGTAGAITT TOCOCAAAAA 960 

TICATCTITA TTCAACTITr GTlTlIACAGrT 1020 

TGAGGAAAGA OGAGGAOGAG AAGGCOGGAG 1080 

TKaTlTbTlT TAATGCTAAT GGtflTlTllAA 1140 

GETGAGGETG TCTCTGEAAA GTGnTAATOG 1200 

GATCTGAAGA GATCAAATCA AGATTOGAAA 1260 

TTATGnQGrro TTTATCTTAC TTPCACTGAT 1320 

TTCTGCTAAT TTGGAAGGTT CGTGGTTACT 1380 

ACTTTCTCGA TIXJI ' ITITIA TTTQGAGGIG 1440 

TTATGAAAAT TtfrTUTlU ' lT CITKrCATTT 1500 

ACTGnTTGC TTTTCITGIT CAGATCAACT 1560 

TGAAAAGCTG GGTIAlTlliA TATCTTAAGA 1620 

AGACTTCATT TGrTCTCTOGT TTTCTAAAGA 1680 

AGAGGOCTGT TTGAATAGTT CTCTCTIGAA 1740 

TGEAGA3TTG TIUITCTIG T TAIAGCTGAT 1800 

GATCCAAACA AICTCTIGCA GAGCTGGGAT 1860 

CATGICACTT GCAACAAOGA GAACAGICTC 1920 
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ATAAGAGICT AAAGCTTTCT TCIACTAATC CCACTTim AACITTGACC TCAG0GTIT3GT 1980 

TACCGACATT 'ITltflTlUT TTCTCAAAIA CAGTIGA3TIG GGQAATOCAG AGTITATCTGG 2040 

CCMTIACTT CCAGAGCTTC GICTGCTCAA GAATTTOCAG TA3TICTAAG TTCCACTEAT 2100 

GCATCATGCT TTAACAAAAC AAATCCAAGA TTIGACAGAA G&AGCACTQG ACTTACCTIT 2160 

TCTAATIGAA ATCTITITAA CAACTTICTT ATTTTCTTAC AGGGAGCTTT ACAGTEAACAA 2220 

CAIAACTGGC CCGAITCCIA GrTAATCTTOG AAATCTCACA AACITACTGA GTTTGGATCT 2280 

TTACTTAAAC AGL T ICTCCG CTCC EA TICC GGAATCATIG GGAAAGCTTT CAAAGCTGAG 2340 

ATTTCTGTGA GTATACATAT QCTTTACOGG CTCAGTITACA C?nUTll*TlT AA3CTTAGCT 2400 

'l'l'lUl T C CAA TTTTIGACTC TTPGCTGAAA ATTTTACATC CAAGAATAGC CGGCTTAACA 2460 

ACAACAGTTCT CACTQGGTCA A3TCCTATGT CACTGACCAA TATEACTACC CTTCAAGTCT 2520 

TGTGAGTCCT CTCATTAACT TTCA3TTATC TCTACTTCAT TCTCCCTCAG TTGMTICTT 2580 

GAGTCTAATGC ACTTAACCTT GATGGATGCA ACACAGAGAT CTATCAAATA ACAGACTCTC 2640 

TGGnTCAGTT CCTGACAATC GCTCCTTCTC ACTCTICACA CCCATCAGGTT TCTATOAITT 2700 

ATCCTCTICA GTTATITCAG Tl t?ni» ' I C a C AGTGTCTGAA CTTATTCTGA AACTITCATT 2760 

TCCTTCTGCA GnTTCCTAA TAACTTAGAC CTATGTQGAC CTOITACAAG TCACCCATCT 2820 

CCTQGATCTC (XXJUbTlTKJ TGX7TCCACCA CCTITEA3TC AACCIXXXXT ALT1T1UJACC 2880 

CCGAGTAAGC lUWllTm 1 TAGTITACAT TAIAGGAAAC AGAAGATGAA ATCTITQCTT 2940 

CTCTGTICAAT IVmTll ' ll ' AIAIAACTCA TCTTCCCAAT AAGGCAAIAA CCAAATGATC 3000 



# 
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TAATTIGATr TCAGGTGGGT A1GGIAIAAC TQGAGCAA3A GCIGGOGGAG TIGCIGCAGG 3060 

TOcrocrnG cicmuji^ ctcotgcaat AGecrrroer tggttogcgac gaagaaagoc 3 120 

ACTAGATATT TTCITCGAIG TGCCTQGTIGA CalTJCATEAUT CQCATTACTT TCTGrTTCTTA 3180 

GCCAGCAATT TTOTTITGCA GAAAAGTA2T GGAACAACTG TTAATGAAAA TCAATACATA 3240 

AfflCATTOIT TnTAAGTEA CAAACTCTTT TGAGTTAAAAT CICGATTGCA AAATCTCTAT 3300 - 

GCAGCCGAAG AAGATCCAGA AGTTCATCTG GGACAGCTCA AGAGGriTITC TTPGCQGGAG 3360 

CTACAAGTTGG CGAGTGATOG GTTTAGTAAC AAGAACATTT TQGGCAGAQG TQGGTTTGGG 3420 

AAAGTICTACA AGGGAOGCTT QGCAGACGGA ACTCTTGriTG CTGTCAAGAG ACTGAAGGAA 3480 

GAGCGAACTC CAGCTGGAGA GCTCCAGrTTT CAAACAGAAG TAGAGA3GAT AACTATQGCA 3540 

GTTCATCGAA ACCTGTTGAG ATTACGAGGT TTCTGTEATCA CACCGACOGA GAGATIQCTT 3600 

GTCTATCC7IT ACATCGCCAA TOGAACTGTIT GCTTCGTCTC TCAGAGCTAA AAACTAAACA 3660 

ATTAAACATC TICTGCTCTC TCTCAATTAC TTIGACGTCA AL7IU1T1T1T CATCTTITOC 3720 

TTTATGGGnT CATAAHTGrTT QGITACACTA AIGACACAGA GAGGCCACOG TCACAACCTC 3780 

CGCTTCATTG GCCAACQCQG AAGAGAATOG CGCTAQQCTC AGCTCGAGGT TICTCTEACC 3840 

TACATGATCA CTGOGATCOG AAGATCATTC AOOjT G AOGTT AAAAGCAGCA AACATCCTCT 3900 

TAGACGAAGA ATICGAAGCG GTTGnTQGAG AnTOGGGTTT GGCAAAGCTA ATOGACIA3A 3960 

AAGACACTCA OGTIGACAACA GCAGTCCGTG GCACCATCGG TCACATOGCT OCAGAATATC 4020 

TCTCAACCGG AAAATCTTCA GAGAAAACOG AQGTTTTCQG AEACQGAATC ATGCTTCTAG 4080 

A 4081 



• 
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(2) INFORMATION FOR SEQ ID ND: 21: 

(i) SEQUENCE CHARALTKKISTICS : 

(A) UEX33IH: 494 amino acids 

(B) TYPE: amino acid 

(C) SraANDECNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: N- terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Glu Ser Ser Tyr Val Val Phe lie Leu Leu Ser Leu lie Leu Leu 
15 10 15 

Pro Asn His Ser Leu Trp Leu Ala Ser Ala Asn Leu Glu Gly Asp Ala 
20 25 30 

Leu His Thr Leu Arg Val Thr Leu Val Asp Pro Asn Asn Val Leu Gin 
35 40 45 

Ser Trp Asp Pro Thr Leu Val Asn Pro Cys Thr Trp Fhe His Val Thr 
50 55 60 

Cys Asn Asn Glu Asn Ser Val lie Arg Val Asp Leu Gly Asn Ala Glu 
65 70 75 80 

Leu Ser Gly His Leu Val Pro Glu Leu Gly Val Leu Lys Asn Leu Gin 
85 90 95 



-69- 



Glu Leu Tyr Ser Asn Asn lie Thr Gly Pro lie Pro Ser Asn Leu Gly 
100 105 HO 

Asn Leu Bar Asn Leu Val Ser Leu Asp Leu Tyr Leu Asn Ser Phe Ser 
115 120 125 . 

Gly Pro lie Pro Glu Ser Leu Gly Lys Leu Ser Lys Leu Arg Phe Leu 
130 135 140 

Arg Leu Asn Asn Asn Ser Leu Thr Gly Ser lie Pro Met Ser Leu Thr 
145 150 155 160 

Asn lie Thr Thr Leu Gin Val Leu Asp Leu Ser Asn Asn Arg Leu Ser 
165 170 175 

Gly Ser Val Pro Asp Asn Gly Ser Phe Ser Leu Phe Thr Pro lie Ser 
180 185 190 

Phe Ala Asn Asn Leu Asp Leu Cys Gly Pro Val Thr Ser His Pro Cys 
195 200 205 

Pro Gly Ser Pro Pro Phe Ser Pro Pro Pro Pro Phe lie Gin Pro Pro 
210 215 220 

Pro Val Ser Thr Pro Ser Gly Tyr Gly lie Thr Gly Ala lie Ala Gly 
225 230 235 240 

Gly Val Ala Ala Gly Ala Ala Leu Leu Phe Ala Ala Pro Ala lie Ala 
245 250 255 

Phe Ala Trp Trp Arg Arg Arg Lys Pro Leu Asp lie Phe Phe Asp Val 
260 265 270 

Pro Ala Glu Glu Asp Pro Glu Val His Leu Gly Gin Leu Lys Arg Phe 
275 280 285 



Ser Leu Arg Glu Leu Gin Val Ala Ser Asp Gly Phe Ser Asn Lys Asn 
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290 295 300 

lie Leu Gly Arg Gly Gly Hie Gly Lys Val Tyr Lys Gly Arg Leu Ala 
305 310 315 320 

Asp Gly Thr Leu Val Ala Val Lys Arg Leu Lys Glu Glu Arg Tiir Pro 
325 330 335 

Gly Gly Glu Leu Gin Phe Gin Thr Glu Val Glu Met lie Ser Met Ala 
340 345 350 

Val His Arg Asn Leu Leu Arg Leu Arg Gly Phe Cys Met Thr Pro Thr 
355 360 365 

Glu Arg Leu Leu Val Tyr Pro Tyr Met Ala Asn Gly Ser Val Ala Ser 
370 375 380 

Cys Leu Arg Glu Arg Pro Pro Ser Gin Pro Pro Leu Asp Trp Pro Thr 
385 390 395 400 

Arg Lys Arg lie Ala Leu Gly Ser Ala Arg Gly Leu Ser Tyr Leu His 
405 410 415 

Asp His Cys Asp Pro Lys lie lie His Arg Asp Val Lys Ala Ala Asn 
420 425 430 

He Leu Leu Asp Glu Glu Phe Glu Ala Val Val Gly Asp Hie Gly Leu 
435 440 445 

Ala Lys Leu Met Asp Tyr Lys Asp Thr His Val Thr Thr Ala Val Arg 
450 455 460 

Gly Thr He Gly His He Ala Pro Glu Tyr Leu Ser Thr Gly Lys Ser 
465 470 475 480 



Ser Glu Lys Thr Asp Val Hie Gly Tyr Gly He Met Leu Leu 
485 490 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEHESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA to iriRNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 142.. 795 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

TCGACCCACG CX3TCCGTCCA ACTTCAATAA AGGGGAAACC AACGTAACCC TAATTTTGCT 

TTCTCCTCTT TGTICAGAAA ATTTTCCCTT TACTCTCAAA TTCCTTTTCG ATITCCCTCT 

CTTAAACCTC CGAAAGCTCA C ATG GOG TCT OGA AAC TAT OGG TOG GAG CTC 

Met Ala Ser Arg Asn Tyr Arg Trp Glu Leu 
15 10 

TTC GCA GCT TOG TEA AOC CEA ACC TEA GOT TTG ATT CAC CTG GTC GAA 
Phe Ala Ala Ser Leu Thr Leu Thr Leu Ala Leu lie His Leu Val Glu 
15 20 25 



GCA AAC TCC GAA QGA GAT GCT CTC TAC GCT CTT CQC OGG ACT 
Ala Asn Ser Glu Gly Asp Ala Leu Tyr Ala Leu Arg Arg Ser 
30 35 40 



TIG ACA 
Leu Ttor 
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GAT CCA GAC CAT GTC CTC CAG AGC TOG GAT CCA. ACT CTT GTT AAT CCT 315 
Asp Pro Asp His Val Leu Gin Ser Trp Asp Pro Thr Leu Val Asn Pro 
45 50 55 

TOT ACC TOG TTC CAT GTC ADC TGT AAC CAA GAC AAC CQC GTC ACT CGT 363 
Cys Thr Trp Phe His Val Thr Cys Asn Gin Asp Asn Arg Val Thr Arg 

60 65 70 

GTG GAT TIG GGA AAT TCA AAC CTC TCT QGA CAT CTT GOG OCT GAG CTT 411 
Val Asp Leu Gly Asn Ser Asn Leu Ser Gly His Leu Ala Pro Glu Leu 
75 80 85 90 

GGG AAG CTT GAA CAT TTA CAG TAT CTA GAG CTC TAC AAA AAC AAC ATC 459 
Gly Lys Leu Glu His Leu Gin Tyr Leu Glu Leu Tyr Lys Asn Asn lie 
95 100 105 

CAA GGA ACT ATA OCT TCC GAA CTT GGA AAT CTG AAG AAT CTC ATC AGC 507 
Gin Gly Thr lie Pro Ser Glu Leu Gly Asn Leu Lys Asn Leu lie Ser 
110 115 120 

TTG GAT CTG TAC AAC AAC AAT CTT ACA GGG ATA GTT COC ACT TIC TIG 555 
Leu Asp Leu Tyr Asn: Asn Asn Leu Thr Gly lie Val Pro Thr Phe Leu 
125 130 135 

GGA AAA TTG AAG TCT CTG GTC TTT TEA GGG CTT AAT GAC AAC OGA TTC 603 
Gly Lys Leu Lys Ser Leu Val Phe Leu Arg Leu Asn Asp Asn Arg Leu 
140 145 150 

ACC GOT CCA ATC CTA GAG CAC TCA COG CAA TCC CAA GOC TTT AAA GTT 651 
Thr Gly Pro lie Leu Glu His Ser Arg Gin Ser Gin Ala Phe Lys Val 
155 160 165 170 

GTT GAC GTC TCA AGC AAT GAT TTG TCT GGG ACA ATC OCA ACA AAC GGA 699 
Val Asp Val Ser Ser Asn Asp Leu Cys Gly Thr lie Pro Thr Asn Gly 
175 180 185 

COC TTT GOT CAC ATT OCT TEA CAG AAC TTT GAG AAC AAC COG AGA TTG 747 
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Pro Phe Ala His He Pro Leu Gin Asn Phe Glu Asn Asn Pro Arg Leu 
190 195 200 

GAG GGA COG GAA TEA CTC GGT CTT GCA AGC TAC GAC ACT AAC TGC ACC 795 
Glu Gly Pro Glu Leu Leu Gly Leu Ala Ser Tyr Asp Thr Asn Cys Ihr 
205 210 .215 

TCAAACAACT GGCAAAACCT GAAAATGAAG AATIQGGQGG TGACCITCEA AGAACACITC 855 

ACCACTTEAT CAAATATCAC A3CTATEATG TAATAAGTAT ATATATGTEAG TAAAAACAAA 915 

AAAAATCAAG AATCGAATCG GTEAATATCAT CTQGTCTCAA TIGAGAACTT CGAGGICTGT 975 

3 ATGTAAAATT TCTAAATGCG ATTTICGCTT ACTGTEAATGT TCGGTrGTOG GATIdGAGA 1035 

B AGrTAACATIT GTATTGGTAT GGTATCAAGT TGTTCTGCCT TGTCTGCAAA AAAAAAAAAA 1095 

AAAAAAAAAA A I 106 



(2) INFORMATICS PQR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Ala Ser Arg Asn Tyr Arg Trp Glu Leu Phe Ala Ala Ser Leu Thr 
15 10 15 

Leu Thr Leu Ala Leu He His Leu Val Glu Ala Asn Ser Glu Gly Asp 
20 25 30 
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Ala Leu Tyr Ala Leu Arg Arg Ser Leu Thr Asp Pro Asp His Val Leu 
35 40 45 

Gin Ser Trp Asp Pro Thr Leu Val Asn Pro Cys Iter Trp Phe His Val 
50 55 60 

Thr Cys Asn Gin Asp Asn Arg Val Thr Arg Val Asp Leu Gly Asn Ser 
65 70 75 80 

Asn Leu Ser Gly His Leu Ala Pro Glu Leu Gly Lys Leu Glu His Leu 
85 90 95 

Gin Tyr Leu Glu Leu Tyr Lys Asn Asn lie Gin Gly Thr lie Pro Ser 
100 105 110 



Glu Leu Gly Asn Leu Lys Asn Leu lie Ser Leu Asp Leu Tyr Asn Asn 
115 120 125 

Asn Leu Thr Gly lie Val Pro Thr Phe Leu Gly Lys Leu Lys Ser Leu 
130 135 140 

Val Phe Leu Arg Leu Asn Asp Asn Arg Leu Thr Gly Pro lie Leu Glu 
145 150 155 160 

His Ser Arg Gin Ser Gin Ala Phe Lys Val Val Asp Val Ser Ser Asn 
165 170 175 

Asp Leu Cys Gly Thr lie Pro Thr Asn Gly Pro Phe Ala His lie Pro 
180 185 190 

Leu Gin Asn Phe Glu Asn Asn Pro Arg Leu Glu Gly Pro Glu Leu Leu 
195 200 205 



Gly Leu Ala Ser Tyr Asp Thr Asn Cys Thr 
210 215 

(2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGEH: 981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MDLBCULE TYPE: cENA to iriRNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATORE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 757 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AGTGTGACTA ATTTA LTITIU VVVLUlUUrC TTTCTICAGA AAATTITCCC TITACTCTCA 60 

AATICCTTTT CGATTTCCCT CIXTITAAACC TCCGAAAGCT CAC ATG GOG TCT CGA 115 

Met Ala Ser Arg 
1 

AAC TAT CGG TOG GAG CTC TIC GCA GCT TOG TTA ACC CTA ACC TTA GCT 163 
Asn Tyr Arg Tip Glu Leu Phe Ala Ala Ser Leu Thr Leu Thr Leu Ala 
5 10 15 20 

TTG ATT CAC CIG GTC GAA GCA AAC TCC GAA GGA GAT GCT CTC TAC GCT 211 
Leu lie His Leu Val Glu Ala Asn Ser Glu Gly Asp Ala Leu Tyr Ala 
25 30 35 

CTT CGC CGG ACT TTG ACA GAT CCA GAC CAT CTC CTC CAG AGC TOG GAT 259 
Leu Arg Arg Ser Leu Thr Asp Pro Asp His Val Leu Gin Ser Trp Asp 
40 45 50 
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CCA ACT CTT GIT AAT cct TOT ACC TOG TIC CAT GTTC ACC TCT AAC CAA 
Pro Thr Leu Val Asn Pro Cys Thr Trp Hie His Val Thr Cys Asn Gin 
55 60 65 



307 



GAC AAC CGC GTC ACT CCT GTG GAT TTG GGA AAT TCA AAC C1C TCT GGA 
Asp Asn Arg Val Thr Arg Val Asp Leu Gly Asn Ser Asn Leu Ser Gly 
70 75 80 



355 



CAT CTT GCG CCT GAG CTT GGG AAG CTT GAA CAT TEA CAG TAT CTA GAG 
His Leu Ala Pro Glu Leu Gly Lys Leu Glu His Leu Gin Tyr Leu Glu 
85 90 95 100 



403 



CIC TAC AAA AAC AAC ATC CAA GGA ACT ATA OCT TCC GAA CTT GGA AAT 
Leu Tyr Lys Asn Asn lie Gin Gly Thr lie Pro Ser Glu Leu Gly Asn 
105 110 115 



451 



CTG AAG AAT CTC ATC AGC TTG GAT CTG TAC AAC AAC AAT CTT ACA GGG 
Leu Lys Asn Leu lie Ser Leu Asp Leu Tyr Asn Asn Asn Leu Thr Gly 
120 125 130 



499 



ATA CTT CCC ACT TCT TTG GGA AAA TIG AAG TCT CTG GTC ITT TEA OGG 
lie Val Pro Thr Ser Leu Gly Lys Leu Lys Ser Leu Val Phe Leu Arg 
135 140 145 



547 



CTT AAT GAC AAC CGA TTG ACC GGT CCA ATC OCT AGA GCA CTC ACG GCA 
Leu Asn Asp Asn Arg Leu Thr Gly Pro lie Pro Arg Ala Leu Thr Ala 
150 155 160 



595 



ATC CCA AGC CTT AAA GTT GTT GAC GTC TCA AGC AAT GAT TTG TGT GGA 
He Pro Ser Leu Lys Val Val Asp Val Ser Ser Asn Asp Leu Cys Gly 
165 170 175 180 



643 



ACA ATC CCA ACA AAC GGA CCC TTT GCT CAC ATT CCT TEA CAG AAC TTT 
Thr He Pro Thr Asn Gly Pro Phe Ala His He Pro Leu Gin Asn Phe 
185 190 195 



691 



GAG AAC AAC COG AGA TTG GAG GGA CCG GAA TEA CTC GCT CTT GCA AGC 



739 
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Glu Asn Asn Pro Arg Leu Glu Gly Pro Glu Leu Leu Gly Leu Ala Ser 
200 205 210 

TAC GAC ACT AAC TGC ACC TGAAACAACT GGCAAAACCT GAAAATGAAG 787 
Tyr Asp Tbr Asn Cys Thr 
215 

AATTGQGGGG TGACCTTGTA AGAACACITC ACCACTTTAT CAAATA3CAC ATCTA1TATG 847 

TAATAACTAT AIATATCTAG TAAAAACAAA AAAAATGAAG AATOGAATCG GTTAATATCAT 907 

CTOGTICrCAA TTGAGAACIT OGAGG7ICICT ATGTAAAATT TCTAAATGOG AUTTTOGOCT 967 

AAATTACTCA CACT 981 

(2) INFORMATION PCJR SBQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SBQ ID ND: 25: 



Met Ala Ser Arg Asn Tyr Arg Trp Glu Leu Phe Ala Ala Ser Leu Ttor 
1 5 10 15 

Leu Thr Leu Ala Leu lie His Leu Val Glu Ala Asn Ser Glu Gly Asp 
20 25 30 

Ala Leu Tyr Ala Leu Arg Arg Ser. Leu Ttor Asp Pro Asp His Val Leu 
35 40 45 

Gin Ser Trp Asp Pro Ttor Leu Val Asn Pro Cys Tbr Trp Phe His Val 
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50 55 60 

Hhr Cys Asn Gin Asp Asn Arg Val Thr Arg Val Asp Leu Gly Asn Ser 

65 70 75 80 

Asn Leu Ser Gly His Leu Ala Pro Glu Leu Gly Lys Leu Glu His Leu 
85 90 95 

Gin Tyr Leu Glu Leu Tyr Lys Asn Asn lie Gin Gly Thr He Pro Ser 
100 105 HO 

Glu Leu Gly Asn Leu Lys Asn Leu He Ser Leu Asp Leu Tyr Asn Asn 
115 120 125 

Asn Leu Thr Gly He Val Pro Thr Ser Leu Gly Lys Leu Lys Ser Leu 
130 135 140 

Val Phe Leu Arg Leu Asn Asp Asn Arg Leu Thr Gly Pro He Pro Arg 
145 150 155 160 

Ala Leu Thr Ala He Pro Ser Leu Lys Val Val Asp Val Ser Ser Asn 
165 170 175 

Asp Leu Cys Gly Thr He Pro Thr Asn Gly Pro Phe Ala His He Pro 
180 185 190 

Leu Gin Asn Phe Glu Asn Asn Pro Arg Leu Glu Gly Pro Glu Leu Leu 
195 200 205 

Gly Leu Ala Ser Tyr Asp Thr Asn Cys Thr 
210 215 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDETMESS : single 

(D) TOPOLOGY: linear 

(ii) MDLBCULE TYPE: cENA to xriRNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 661 



(xi) SEQUENCE DESCRIPTION: SEQ ID ND: 26: 

T CGA CCC ACG OCT COG CGA AAC TAT COG TGG GAG CTC TTC GCA GCT 
Arg Pro Tfor Arg Pro Arg Asn Tyr Arg Trp Glu Leu Phe Ala Ala 
15 10 15 

TCG TEA ATC CTA ACC TEA GCT TIG ATT CAC CTC GTC GAA GCA AAC TCC 
Ser Leu lie Leu Thr Leu Ala Leu lie His Leu Val Glu Ala Asn Ser 
20 25 30 

GAA QGA GAT GCT CTT TAC GCT CTT CGC COG ACT TEA ACA GAT CCG GAC 
Glu Gly Asp Ala Leu Tyr Ala Leu Arg Arg Ser Leu Thr Asp Pro Asp 
35 40 45 

CAT CTT CTC CAG AGO TGG GAT CCA ACT CTT GIT AAT OCT T3T ACC TOG 
His Val Leu Gin Ser Trp Asp Pro Thr Leu Val Asn Pro Cys Thr Trp 
50 55 60 

TIC CAT CTC ACC TCT AAC CAA GAC AAC CGC GTC ACT OCT CTG GAT TTG 
Phe His Val Thr Cys Asn Gin Asp Asn Arg Val Thr Arg Val Asp Leu 
65 70 75 



GOG AAT TCA AAC CTC TCT QGA CAT CTT GOG CCT GAG CTT GGG AAG CTT 
Gly Asn Ser Asn Leu Ser Gly His Leu Ala Pro Glu Leu Gly Lys Leu 
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80 



85 90 95 



GAA CAT TEA CAG TAT CTA GAG CTC TAG AAA AAC AAC ATC CAA GGA ACT 334 
Glu His Leu Gin Tyr Leu Glu Leu Tyr Lys Asn Asn He Gin Gly Thr 
100 105 HO 

ATA CCT TCC GAA CTT GGA AAT CTG AAG AAT CTC ATC AGC TIG GAT CTG 382 
He Pro Ser Glu Leu Gly Asn Leu Lys Asn Leu He Ser Leu Asp Leu 
115 120 125 



430 



TAC AAC AAC AAT CTT ACA GGG ATA GOT CCC ACT TCT TIG GGA AAA TIG 
Tyr Asn Asn Asn Leu Thr Gly He Val Pro Thr Ser Leu Gly Lys Leu 
130 135 140 

AAG TCT CTG GOC TTT TEA GGG CTT AAT GAC AAC OGA TIG ACG GGG CCA 478 
Lys Ser Leu Val Phe Leu Arg Leu Asn Asp Asn Arg Leu Thr Gly Pro 
145 150 155 

ATC CCT AGA GCA CTC ACT GCA ATC CCA AGC CTT AAA GOT GOT GAT GTIC 526 
He Pro Arg Ala Leu Thr Ala He Pro Ser Leu Lys Val Val Asp Val 
160 165 170 175 

TCA AGC AAT GAT TIG TGT GGA ACA ATC CCA ACA AAC GGA CCT TTT GCT 574 
Ser Ser Asn Asp Leu Cys Gly Thr He Pro Thr Asn Gly Pro Phe Ala 
180 185 190 

CAC ATT CCT TEA CAG AAC TIT GAG AAC AAC COG AGG TIG GAG GGA COG 622 
His He Pro Leu Gin Asn Phe Glu Asn Asn Pro Arg Leu Glu Gly Pro 
195 200 205 

GAA TEA CTC GGT CTT GCA AGC TAC GAC ACT AAC TGC ACC TGAAAAAATT 671 
Glu Leu Leu Gly Leu Ala Ser Tyr Asp Thr Asn Cys Ihr 
210 215 220 

GGCAAAAOCT GAAAATGAAG AATTGGGGGG TGACCTTGTEA AGAACACTTC AOCACTTEAT 731 



CAAATATCAC ATCTACTA3G TAATAAG7EAT ATATATGTAG TCCAAAAAAA AAAAAAAA 



789 
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(2) INFORMATION FOR SBQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 27: 

Arg Pro Thr Arg Pro Arg Asn Tyr Arg Trp Glu Leu Phe Ala Ala Ser 
1 5 10 15 

Leu He Leu Thr Leu Ala Leu He His Leu Val Glu Ala Asn Ser Glu 
20 25 30 

Gly Asp Ala Leu Tyr Ala Leu Arg Arg Ser Leu Thr Asp Pro Asp His 
35 40 45 

Val Leu Gin Ser Trp Asp Pro Thr Leu Val Asn Pro Cys Thr Trp Phe 
50 55 60 

His Val Thr Cys Asn Gin Asp Asn Arg Val Thr Arg Val Asp Leu Gly 
65 70 75 80 

Asn Ser Asn Leu Ser Gly His Leu Ala Pro Glu Leu Gly Lys Leu Glu 
85 90 95 

His Leu Gin Tyr Leu Glu Leu Tyr Lys Asn Asn He Gin Gly Thr He 
100 105 110 

Pro Ser Glu Leu Gly Asn Leu Lys Asn Leu He Ser Leu Asp Leu Tyr 
115 120 125 
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Asn Asn Asn Leu Thr Gly He Val Pro Tbr Ser Leu Gly Lys Leu Lys 
130 135 140 

Ser Leu Val Phe Leu Arg Leu Asn Asp Asn Arg Leu Tfar Gly Pro He 
145 150 155 160 

Pro Arg Ala Leu Thr Ala He Pro Ser Leu Lys Val Val Asp Val Ser 
165 170 175 

Ser Asn Asp Leu Cys Gly Tfcr He Pro Thr Asn Gly Pro Phe Ala His 
180 185 190 

He Pro Leu Gin Asn Phe Glu Asn Asn Pro Arg Leu Glu Gly Pro Glu 
195 200 205 

Leu Leu Gly Leu Ala Ser Tyr Asp Ibr Asn Cys Thr 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 b ase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA to iriRNA 
(iii) HYPOTHETICAL: NO 



<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..675 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
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GGA OOG ATT CAA GOC TOC GAA GQG GAC OCT CTT CAC GOG CTT OGC OQG 
Gly Pro lie Gin Ala Ser Glu Gly Asp Ala Leu His Ala Leu Aug Arg 
! 5 10 15 

AGC TTA TCA GAT CCA GAC AAT GTT CTT GAG ACT TOG GAT OCA ACT CTT 
Ser Leu Ser Asp Pro Asp Asn Val Val Gin Ser Trp Asp Pro Thr Leu 
20 25 30 



GTT AAT COT TGT ACT TOG TIT CAT CTC ACT TCT AAT CAA CAC CAT CAA 
Val Asn Pro Cys Thr Trp Phe His Val Thr Cys Asn Gin His His Gin 
35 40 45 



GTC ACT COT CTG GAT TTG GGG AAT TCA AAC TEA TCT GGA CAT CTA G7IA 
Val Thr Arg Leu Asp Leu Gly Asn Ser Asn Leu Ser Gly His Leu Val 
50 55 60 

CCT GAA CTT GGG AAG CTT GAA CAT TTA CAA TAT CTG TAT GGA ATC ATC 
Pro Glu Leu Gly Lys Leu Glu His Leu Gin Tyr Leu Tyr Gly lie lie 
65 70 75 80 



ACT CTT TTG CCT TTT GAT TAT CTG AAA ACA TTT ACA TTA TCA GTC ACA 
Thr Leu Leu Pro Phe Asp Tyr Leu Lys Thr Phe Thr Leu Ser Val Thr 
85 90 95 



CAT ATA ACA TTT TQC TTT GAG TCA TAT ACT GAA CTC TAC AAA AAC GAG 
His lie Thr Phe Cys Phe Glu Ser Tyr Ser Glu Leu Tyr Lys Asn Glu 
100 105 110 

ATT CAA GGA ACT ATA OCT TCT GAG CTT GGA AAT CTG AAG ACT CTA ATC 
lie Gin Gly Thr lie Pro Ser Glu Leu Gly Asn Leu Lys Ser Leu lie 
115 120 125 



ACT TTG GAT CTG TAC AAC AAC AAT CTC AGC GGG AAA ATC CCA TCT TCT 
Ser Leu Asp Leu Tyr Asn Asn Asn Leu Thr Gly Lys lie Pro Ser Ser 
130 135 140 
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TIG GGA AAA TIG AAG TCA CTT GOT TIT TIG COG CTT AAC GAA AAC CGA 480 
Leu Gly Lys Leu Lys Ser Leu Val Phe Leu Arg Leu Asn Glu Asn Arg 
145 150 155 160 

TIG ACC GGT CCT ATT CCT AGA GAA CTC ACA GOT ATT TCA AGC CTT AAA 528 
Leu Thr Gly Pro He Pro Arg Glu Leu Thr Val lie Ser Ser Leu Lys 
165 170 175 

GOT GOT GAT GTC TCA GOG AAT GAT TIG TOT GGA ACA ATT OCA GTA GAA 576 
Val Val Asp Val Ser Gly Asn Asp Leu Cys Gly Thr He Pro Val Glu 
180 185 190 

GGA CCT TOT GAA CAC ATT CCT AUG CAA AAC TIT GAG AAC AAC CIG AGA 624 
Gly Pro Phe Glu His He Pro Met Gin Asn Phe Glu Asn Asn Leu Arg 
195 200 205 

TIG GAG GGA CCA GAA CTA CEA GGT COT GCG AGC TAT GAC ACC AAT TGC 672 
Leu Glu Gly Pro Glu Leu Leu Gly Leu Ala Ser Tyr Asp Thr Asn Cys 
210 215 220 

ACT TAAAAAGAAG TIGAAGAAOC TATAAAGAAG AATGOTAGGT GACCTTGTAA 725 

Thr 
225 

GAACICIGTA OCAAGTGOTT GTIAAATCTAT ATAGAGCCTT GTOTCATGOT ATATATGAAA 785 

GCTTIGAGAG ACAGTAACOT GCAATGTTATT GGTATTGGTA GAAAAAGOTG AAATGAGAAT 845 

TGCTOTGEAA TIGGATTICT GTOTCOTATG TAACTTGAAT TTCOTATTA 894 

(2) INPORMATTCN FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TOPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID ND: 29: 

Gly Pro He Gin Ala Ser Glu Gly Asp Ala Leu His Ala Leu Arg Arg 
1 5 10 15 

Ser Leu Ser Asp Pro Asp Asn Val Val Gin Ser Trp Asp Pro Thr Leu 
20 25 30 

Val Asn Pro Cys Thr Trp Phe His Val Itor Cys Asn Gin His His Gin 
35 40 45 

Val Thr Arg Leu Asp Leu Gly Asn Ser Asn Leu Ser Gly His Leu Val 
50 55 60 

Pro Glu Leu Gly Lys Leu Glu His Leu Gin Tyr Leu Tyr Gly He He 
65 70 75 80 

Thr Leu Leu Pro Phe Asp Tyr Leu Lys Tbr Phe Thr Leu Ser Val Thr 
85 90 95 

His He Thr Phe Cys Phe Glu Ser Tyr Ser Glu Leu Tyr Lys Asn Glu 
100 105 110 

He Gin Gly Thr lie Pro Ser Glu Leu Gly Asn Leu Lys Ser Leu He 
115 120 125 

Ser Leu Asp Leu Tyr Asn Asn Asn Leu Thr Gly Lys He Pro Ser Ser 
130 135 140 

Leu Gly Lys Leu Lys Ser Leu Val Phe Leu Arg Leu Asn Glu Asn Arg 
145 150 155 160 

Leu Thr Gly Pro He Pro Arg Glu Leu Thr Val He Ser Ser Leu Lys 
165 170 175 
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Val Val Asp Val Ser Gly Asn Asp Leu Cys Gly Dir lie Pro Val Glu 
180 185 190 

Gly Pro Phe Glu His lie Pro Met Gin Asn Phe Glu Asn Asn Leu Arg 
195 200 205 

Leu Glu Gly Pro Glu Leu Leu Gly Leu Ala Ser Tyr Asp Tbr Asn Cys 
210 215 220 

Thr 
225 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1063 base pairs 

(B) TOPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA to iriRNA 
(iii) HYPOTHETICAL: NO 



(ix) FEA3URE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 106. .759 



(xi) SEQUENCE DESCRIPTION: SEC 3D NO: 30: 
TCGACCCACG OGICOGADGA AACCCTAATT TIGCTTOCTC ATCTTGTTCA GAAAATTACT 60 



CAAATICCTA TTAGATTACT CTCICTIOGA CCICCGATAG CTCAC AUG GOG TCT 

Met Ala Ser 



114 
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1 



CGA AAC TAT CGG TOG GAG CTC TTC GCA- GCT TOG TTA ATC CTA ACC TEA 
Arg Asn Tyr Arg Trp Glu Leu Phe Ala Ala Ser Leu He Leu Thr Leu 



10 



15 



162 



GCT TIG ATT CAC CTG GTC GAA GCA AAC TCC GAA GGA GAT GCT CTT TAC 
Ala Leu He His Leu Val Glu Ala Asn Ser Glu Gly Asp Ala Leu Tyr 
20 25 30 35 



210 



GCT CTT CGC CGG ACT TTA ACA GAT COG GAC CAT GIT CTC CAG AGC TOG 
Ala Leu Arg Arg Ser Leu Thr Asp Pro Asp His Val Leu Gin Ser Trp 
40 45 50 



258 



GAT CCA ACT CTT GTT AAT CCT TGT ACC TOG TTC CAT GTC ACC TGT AAC 
Asp Pro Thr Leu Val Asn Pro Cys Thr Trp Hie His Val Thr Cys Asn 
55 60 65 



306 



CAA GAC AAC CGC GTC ACT CGT GTC GAT TIG GGG AAT TCA AAC CTC TCT 
Gin Asp Asn Arg Val Thr Arg Val Asp Leu Gly Asn Ser Asn Leu Ser 
70 75 80 



354 



GGA CAT CTT GOG OCT GAG CTT GGG AAG CTT GAA CAT TTA CAG TAT CTA 402 
Gly His Leu Ala Pro Glu Leu Gly Lys Leu Glu His Leu Gin Tyr Leu 
85 90 95 

GAG CTC TAC AAA AAC AAC ATC CAA GGA ACT ATA CCT TCC GAA CTT GGA 450 
Glu Leu Tyr Lys Asn Asn He Gin Gly Thr He Pro Ser Glu Leu Gly 
100 105 110 115 

AAT CTG AAG AAT CTC ATC AGC TTG GAT CTG TAC AAC AAC AAT CTT ACA 498 
Asn Leu Lys Asn Leu He Ser Leu Asp Leu Tyr Asn Asn Asn Leu Thr 
120 125 130 



GGG ATA GTT CCC ACT TCT TTG GGA AAA TIG AAG TCT CTG GTC TIT TTA 
Gly He Val Pro Thr Ser Leu Gly Lys Leu Lys Ser Leu Val Phe Leu 
135 140 145 



546 
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CQG CTT AAT GAC AAC CGA TTO ACG QQG CCA ATC CCT AGA GCA CTC ACT 594 
Arg Leu Asn Asp Asn Arg Leu Tfar Gly Pro lie Pro Arg Ala Leu Thr 
150 155 160 

GCA ATC CCA AGC CTT AAA CTT GIT GAT GTC TCA AGC AAT GAT TIG TGT 642 
Ala lie Pro Ser Leu Lys Val Val Asp Val Ser Ser Asn Asp Leu Cys 
165 170 175 



: rl=? 



GGA ACA ATC CCA ACA AAC QGA OCT TTT GCT CAC ATT CCT TEA CAG AAC 690 
Gly Thr lie Pro Thr Asn Gly Pro Phe Ala His lie Pro Leu Gin Asn 
180 185 190 195 

TTT GAG AAC AAC COG AGG TIG GAG GGA CCG GAA TEA CTC GGT CTT GCA 738 
Phe Glu Asn Asn Pro Arg Leu Glu Gly Pro Glu Leu Leu Gly Leu Ala 
200 205 210 

AGC TAC GAC ACT AAC TQC ACC TGAAAAAATT GQCAAAACCT GAAAATGAAG 789 
Ser Tyr Asp Thr Asn Cys Thr 
215 

AATTQGGGGG TGACCITGTA AGAACACTTC ACCACTTEAT CAAATATCAC ATCTACTATG 849 

TAATAAGTAT ATATATGTAG TCCAAAAAAA AAATGAAGAA TCGAATCAGT AATATCATCT 909 

GGTCTCAATT GAGAACITTG AGGICTGTGT ATGTAAAATT TCTAAATGCG ACTTTCGOGT 969 

ACTGTAATGT TOGGTTGTGG GATTCTGAGA AGTAACATTT CTAITGCTAT GGTATCAAGT 1029 

'ICTICIGOCT TGTCTGCAAA AAAAAAAAAA AAAA 1063 

(2) INFORMATION FOR SBQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Ala Ser Arg Asn Tyr Arg Trp Glu Leu Phe Ala Ala Ser I^u lie 
1 5 10 15 

Leu Thr Leu Ala Leu lie His Leu Val Glu Ala Asn Ser Glu Gly Asp 
20 25 30 

Ala Leu Tyr Ala Leu Arg Arg Ser Leu Thr Asp Pro Asp His Val Leu 
35 40 45 

Gin Ser Trp Asp Pro Thr Leu Val Asn Pro Cys Thr Trp Phe His Val 
50 55 60 

Thr Cys Asn Gin Asp Asn Arg Val Thr Arg Val Asp Leu Gly Asn Ser 
65 70 75 80 

Asn Leu Ser Gly His Leu Ala Pro Glu Leu Gly Lys Leu Glu His Leu 
85 90 95 

Gin Tyr Leu Glu Leu Tyr Lys Asn Asn He Gin Gly Thr He Pro Ser 
100 105 110 

Glu Leu Gly Asn Leu Lys Asn Leu He Ser Leu Asp Leu Tyr Asn Asn 
115 120 125 

Asn Leu Thr Gly lie Val Pro Thr Ser Leu Gly Lys Leu Lys Ser Leu 
130 135 140 

Val Phe Leu Arg Leu Asn Asp Asn Arg Leu Thr Gly Pro He Pro Arg 
145 150 155 160 

Ala Leu Thr Ala He Pro Ser Leu Lys Val Val Asp Val Ser Ser Asn 
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165 170 175 

Asp Leu Cys Gly Ttor lie Pro Thr Asn Gly Pro Pfae Ala His lie Pro 
180 185 190 

Leu Gin Asn Phe Glu Asn Asn Pro Arg Leu Glu Gly Pro Glu Leu Leu 
195 200 205 

Gly Leu Ala Ser Tyr Asp Thr Asn Cys Thr 
210 215 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

(ii) MDLBCULE TYPE: cENA to iriRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SERK gene cENA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 195.. 2069 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 32: 
GGAITnTAT TITAimTIT ACTCTTTGTT TGTTTTEAATG CTAATGQGIT TTEAAAAGGG 
TIATCGAAAA AATGACTGAG TTTCTGITGA GGTICTCTCT GEAAAGTGTT AAIGGTGGTG 120 
AITTICGGAA GITAGGGTTr TCTCGGATCT GAAGAGATCA AATCAAGATT OGAAA3TEAC 180 

catigtictt tgaa atg gag tog act tat gig gig tit atc tea err tca 230 

Met Glu Ser Ser Tyr Val Val Phe lie Leu Leu Ser 
15 10 

CTG ATC TTA CTT COG AAT CAT TCA CIG TOG CTT GCT TCT GCT AAT TTG 278 
Leu lie Leu Leu Pro Asn His Ser Leu Trp Leu Ala Ser Ala Asn Leu 
15 20 25 

GAA GCT GAT GCT TTG CAT ACT TTG AGG GIT ACT CEA GIT GAT CCA AAC 326 
Glu Gly Asp Ala Leu His Thr Leu Arg Val Thr Leu Val Asp Pro Asn 
30 35 40 

AAT GTC TTG GAG AGC TGG GAT CCT ACG CEA CTG AAT CCT TQC ACA TQG 374 
Asn Val Leu Gin Ser Trp Asp Pro Thr Leu Val Asn Pro Cys Thr Trp 
45 50 55 60 

TIC CAT GTC ACT TGC AAC AAC GAG AAC ACT GTC ATA AGA CTT GAT TTG 422 
Phe His Val Tbr Cys Asn Asn Glu Asn Ser Val lie Arg Val Asp Leu 
65 70 75 

GGG AAT GCA GAG TTA TCT GGC CAT TEA GIT CCA GAG CTT GCT GTG CIC 470 
Gly Asn Ala Glu Leu Ser Gly His Leu Val Pro Glu Leu Gly Val Leu 
80 85 90 

AAG AAT TTG CAG TAT TTG GAG CTT TAC ACT AAC AAC ATA ACT GGC COG 518 
Lys Asn Leu Gin Tyr Leu Glu Leu Tyr Ser Asn Asn He Thr Gly Pro 
95 100 105 

ATT CCT ACT AAT CTT GGA AAT CTG ACA AAC TEA GIG ACT TTG GAT CTT 566 
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Ile Pro Ser Asn Leu Gly Asn . Leu Thr Asn Leu Val Ser I^u Asp Leu 
110 115 120 

TAC TEA AAC AGC TTC TCC GCT OCT ATT COG GAA TCA TTG GGA AAG CTT 
Tyr Leu Asn Ser Phe Ser Gly Pro lie Pro Glu Ser Leu Gly Lys Leu 
125 130 135 140 



614 



TCA AAG CTG AGA ITT CTC CGG CTT AAC AAC AAC ACT CTC ACT GGG TCA 
Ser Lys Leu Arg Phe Leu Arg Leu Asn Asn Asn Ser Leu Thr Gly Ser 
145 150 155 



662 



ATT CCT ATG TCA CTG ACC AAT AIT ACT ACC CTT CAA CTG TEA GAT CTA 
lie Pro Met Ser Leu Thr Asn lie Thr Thr Leu Gin Val Leu Asp Leu 
160 165 170 



710 



TCA AAT AAC AGA CTC TCT GCT TCA GTT OCT GAC AAT GGC TCC TIC TCA 
Ser Asn Asn Arg Leu Ser Gly Ser Val Pro Asp Asn Gly Ser Phe Ser 
175 180 185 



758 



CTC TTC ACA CCC ATC ACT TTT GCT AAT AAC TEA GAC CTA TGT GGA CCT 
Leu Phe Ihr Pro lie Ser Phe Ala Asn Asn Leu Asp Leu Cys Gly Pro 
190 195 200 



806 



GOT ACA ACT CAC CCA TCT CCT GGA TCT CCC COG TTT TCT CCT CCA CCA 
Val Ihr Ser His Pro Cys Pro Gly Ser Pro Pro Phe Ser Pro Pro Pro 
205 210 215 220 



854 



CCT TTT ATT CAA CCT CCC CCA GTT TCC ACC CCG ACT GGG TAT GCT ATA 
Pro Phe lie Gin Pro Pro Pro Val Ser Thr Pro Ser Gly Tyr Gly lie 
225 230 235 



902 



ACT GGA GCA ATA GCT GCT GGA GTT GCT GGA GCT GCT GCT TTG CCC TTT 
Thr Gly Ala lie Ala Gly Gly Val Ala Ala Gly Ala Ala Leu Pro Phe 
240 245 250 



950 



GCT GCT CCT GCA ATA GCC TTT GCT TOG TOG OGA OGA AGA AGC CCA CTA 
Ala Ala Pro Ala lie Ala Hie Ala Trp Trp Arg Arg Arg Ser Pro Leu 



998 



255 



260 



265 



GAT AIT TTC TTC GAT GTC OCT GCC GAA GAA GAT CCA GAA GOT CAT CTG 
Asp lie Phe Phe Asp Val Pro Ala Glu Glu Asp Pro Glu Val His Leu 
270 275 280 

GGA CAG CTC AAG AGG TTT TCT TIG OGG GAG CTA CAA GTCG GOG ACT GAT 
Gly Gin Leu Lys Arg Phe Ser Leu Arg Glu Leu Gin Val Ala Ser Asp 
285 290 295 300 

GQG TTT ACT AAC AAG AAC ATT TIG GGC AGA GCT GOG TIT GGG AAA GTC 
Gly Phe Ser Asn Lys Asn lie Leu Gly Arg Gly Gly Phe Gly Lys Val 
305 310 315 

TAG AAG GGA CGC TTG GCA GAC GGA ACT CTT GIT GCT GTC AAG AGA CTG 
Tyr Lys Gly Arg Leu Ala Asp Gly Thr Leu Val Ala Val Lys Arg Leu 
320 325 330 

AAG GAA GAG OGA ACT CCA GCT GGA GAG CTC CAG TTT CAA ACA GAA CTA 
Lys Glu Glu Arg Thr Pro Gly Gly Glu Leu Gin Phe Gin Thr Glu Val 
335 340 345 

GAG ATG ATA ACT ATG GCA GOT CAT OGA AAC CTG TIG AGA TEA OGA GCT 
Glu Met lie Ser Met Ala Val His Arg Asn Leu Leu Arg Leu Arg Gly 
350 355 360 

TTC TCT ATG ACA COG ACC GAG AGA TTG CTT CTG TAT CCT TAC ATG GCC 
Phe Cys Met Thr Pro Thr Glu Arg Leu Leu Val Tyr Pro Tyr- Met Ala 
365 370 375 380 

AAT GGA ACT GTT GCT TOG TCT CTC AGA GAG AGG CCA COG TCA CAA OCT 
Asn Gly Ser Val Ala Ser Cys Leu Arg Glu Arg Pro Pro Ser Gin Pro 
385 390 395 



CCG CTT GAT TOG CCA ACG OGG AAG AGA ATC GOG CTA GGC TCA GCT OGA 
Pro Leu Asp Trp Pro Thr Arg Lys Arg lie Ala Leu Gly Ser Ala Arg 
400 405 410 
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.0 



GCT TTG TCT TAC CIA CAT GAT CAC TGC GAT COG AAG ATC ATT CAC OCT 1478 
Gly Leu Ser Tyr Leu His Asp His Cys Asp Pro Lys lie lie His Arg 
415 420 425 

t 

GAC GTA AAA GCA GCA AAC ATC CTC TTA GAC GAA OA TTC GAA GOG GOT 1526 
Asp Val Lys Ala Ala Asn lie Leu Leu Asp Glu Glu Phe Glu Ala Val 
430 435 440 

GET GGA GAT TTC* GGG TIG GCA AAG CTT ATG GAC TAT AAA GAC ACT CAC 1574 
Val Gly Asp Phe Gly Leu Ala Lys Leu Met Asp Tyr Lys Asp Thr His 
445 450 455 460 

GTC ACA ACA GCA GTC OCT GGC ACC ATC GCT CAC ATC GCT CCA GAA TAT 1622 
Val Thr Thr Ala Val Arg Gly Thr lie Gly His lie Ala Pro Glu Tyr 
465 470 475 

CTC TCA ACC GGA AAA TCT TCA GAG AAA ACC GAC GTT TTC GGA TAC GGA 1670 
Leu Ser Thr Gly Lys Ser Ser Glu Lys Thr Asp Val Phe Gly Tyr Gly 
480 485 490 

ATC ATG CTT CTA GAA CIA ATC ACA GGA CAA AGA GCT TTC GAT CTC GCT 1718 
lie Met Leu Leu Glu Leu lie Thr Gly Gin Arg Ala Phe Asp Leu Ala 
495 500 505 

CGG CTA GCT AAC GAC GAC GAC GTC ATG TTA CTT GAC TOG GIG AAA GGA 1766 
Arg Leu Ala Asn Asp Asp Asp Val Met Leu Leu Asp Trp Val Lys Gly 
510 515 520 

TTG TTG AAG GAG AAG AAG CTA GAG ATG TTA CTG GAT CCA GAT CTT CAA i814 
Leu Leu Lys Glu Lys Lys Leu Glu Met Leu Val Asp Pro Asp Leu Gin 
525 530 535 540 

ACA AAC TAC GAG GAG AGA GAA CTG GAA CAA CTG ATA CAA GTG GOG TTG 1862 
Thr Asn Tyr Glu Glu Arg Glu Leu Glu Gin Val lie Gin Val Ala Leu 
545 550 555 



-95- 



CTA TGC AOG CAA GGA TCA CCA AUG GAA AGA OCA AAG ATG TCT GAA GIT 
Leu Cys Thr Gin Gly Ser Pro Met Glu Arg Pro Lys Met Ser Glu Val 
560 565 570 

CTA AGG ATG CTG GAA GGA GAT GOG CTT GCG GAG AAA TOG GAC GAA TGG 
Val Arg Met Leu Glu Gly Asp Gly Leu Ala Glu Lys Trp Asp Glu Trp 
575 580 585 

CAA AAA GIT GAG ATT TIG AGG GAA GAG ATT GAT TIG ACT CCT AAT CCT 
Gin Lys Val Glu lie Leu Arg Glu Glu lie Asp Leu Ser Pro Asn Pro 
590 595 600 

AAC TCT GAT TOG ATT CTT GAT TCT ACT TAC AAT TIG CAC GCC CTT GAG 
Asn Ser Asp Trp lie Leu Asp Ser Thr Tyr Asn Leu His Ala Val Glu 
605 610 615 620 

TTA TCT GGT CCA AGG TAAAAAAAAA AAAAAAAAAA 
Leu Ser Gly Pro Arg 
625 



(2) INFORMATION FOR SBQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTIH: 625 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE nESCRIFTICN: SBQ ID NO: 33: 

Met Glu Ser Ser Tyr Val Val Phe lie Leu Leu Ser Leu lie Leu Leu 
15 10 15 



Pro Asn His Ser Leu Trp Leu Ala Ser Ala Asn Leu Glu Gly Asp Ala 
20 25 30 
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Leu His Thr Leu Arg Val Thr Leu Val Asp Pro Asn Asn Val Leu Gin 
35 40 45 

Ser Trp Asp Pro Thr Leu Val Asn Pro Cys Thr Trp Phe His Val Thr 
50 55 60 

Cys Asn Asn Glu Asn Ser Val lie Arg Val Asp Leu Gly Asn Ala Glu 
65 70 75 80 

Leu Ser Gly His Leu Val Pro Glu Leu Gly Val Leu Lys Asn Leu Gin 
85 90 95 

Tyr Leu Glu Leu Tyr Ser Asn Asn lie Tbr Gly Pro lie Pro Ser Asn 
100 105 110 

Leu Gly Asn Leu Thr Asn Leu Val Ser Leu Asp Leu Tyr Leu Asn Ser 
115 120 125 

Phe Ser Gly Pro lie Pro Glu Ser Leu Gly Lys Leu Ser Lys Leu Arg 
130 135 140 

Phe Leu Arg Leu Asn Asn Asn Ser Leu Thr Gly Ser lie Pro Met Ser 
145 150 155 160 

Leu Thr Asn lie Thr Thr Leu Gin Veil Leu Asp Leu Ser Asn Asn Arg 
165 170 175 

Leu Ser Gly Ser Val Pro Asp Asn Gly Ser Hie Ser Leu Hie Thr Pro 
180 185 190 

lie Ser Phe Ala Asn Asn Leu Asp Leu Cys Gly Pro Val Thr Ser His 
195 200 205 



Pro Cys Pro Gly Ser Pro Pro Phe Ser Pro Pro Pro Pro Phe He Gin 
210 215 220 
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Pro Pro Pro Val Ser Thr Pro Ser Gly Tyr Gly He Thr Gly Ala lie 
225 230 235 240 

Ala Gly Gly Val Ala Ala Gly Ala Ala Leu Pro Phe Ala Ala Pro Ala 
245 250 255 

He Ala Phe Ala Trp Trp Arg Arg Arg Ser Pro Leu Asp He Phe Phe 
260 265 270 

Asp Val Pro Ala Glu Glu Asp Pro Glu Val His Leu Gly Gin Leu Lys 
275 280 285 

Arg Phe Ser Leu Arg Glu Leu Gin Val Ala Ser Asp Gly Phe Ser Asn 
290 295 300 

Lys Asn He Leu Gly Arg Gly Gly Phe Gly Lys Val Tyr Lys Gly Arg 
305 310 315 320 

Leu Ala Asp Gly Thr Leu Val Ala Val Lys Arg Leu Lys Glu Glu Arg 
325 330 335 

Thr Pro Gly Gly Glu Leu Gin Phe Gin Thr Glu Val Glu Met He Ser 
340 345 350 

Met Ala Val His Arg Asn Leu Leu Arg Leu Arg Gly Phe Cys Met Thr 
355 360 365 

Pro Thr Glu Arg Leu Leu Val Tyr Pro Tyr Met Ala Asn Gly Ser Val 
370 375 380 

Ala Ser Cys Leu Arg Glu Arg Pro Pro Ser Gin Pro . Pro Leu Asp Trp 
385 390 395 400 



Pro Thr Arg Lys Arg He Ala Leu Gly Ser Ala Arg Gly Leu Ser Tyr 
405 410 415 



Leu His Asp His Cys Asp Pro Lys He lie His Arg Asp Val Lys Ala 



98 



420 



425 



430 



Ala Asn lie Leu Leu Asp Glu Glu Phe Glu Ala Val Val Gly Asp Phe 
435 440 445 

Gly Leu Ala Lys Leu Met Asp Tyr Lys Asp Thr His Val Thr Thr Ala 
450 455 460 

Val Arg Gly Thr lie Gly His lie Ala Pro Glu Tyr Leu Ser Thr Gly 
465 470 475 480 

Lys Ser Ser Glu Lys Thr Asp Val Phe Gly Tyr Gly lie Met Leu Leu 
- 485 490 495 

Glu Leu lie Thr Gly Gin Arg Ala Phe Asp Leu Ala Arg Leu Ala Asn 
500 505 510 

Asp Asp Asp Val Met Leu Leu Asp Trp Val Lys Gly Leu Leu Lys Glu 
515 520 525 

Lys Lys Leu Glu Met Leu Val Asp Pro Asp Leu Gin Thr Asn Tyr Glu 
530 535 540 

Glu Arg Glu Leu Glu Gin Val lie Gin Val Ala Leu Leu Cys Thr Gin 
545 550 555 560 

Gly Ser Pro Met Glu Arg Pro Lys Met Ser Glu Val Val Arg Met Leu 
565 570 575 

Glu Gly Asp Gly Leu Ala Glu Lys Trp Asp Glu Trp Gin Lys Val Glu 
580 585 590 



lie Leu Arg Glu Glu lie Asp Leu Ser Pro Asn Pro Asn Ser Asp Trp 
595 600 605 



lie Leu Asp Ser Thr Tyr Asn Leu His Ala Val Glu Leu Ser Gly Pro 
610 615 620 



99 



Arg 
625 



